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Abstract 

The issue of how to create open-ended evolution in an arti- 
ficial system is one the open problems in artificial life. This 
paper examines two of the factors that have some bearing on 
this issue, using the Tierra artificial life system. 

Parsimony pressure is a tendency to penalise more complex 
organisms by the extra cost needed to reproduce longer geno- 
types, encouraging simplification to happen. In Tierra, par- 
simony is controlled by the SlicePow parameter. When 
full parsimony is selected, evolution optimises the ancestral 
organism to produce extremely simple organisms. With par- 
simony completely relaxed, organisms grow larger, but not 
more complex. They fill up with "junk". This paper looks at 
scanning a range of SlicePow from 0.9 to 1 to see if there 
is an optimal value for generating complexity. 

Tierra (along with most ALife systems) use pseudo random 
number generators. Algorithms can never create information, 
only destroy it. So the total complexity of the Tierra system 
is bounded by the initial complexity, implying that the indi- 
vidual organism complexity is bounded. Biological systems, 
however, have plenty of sources of randomness, ultimately 
dependent on quantum randomness, so do not have this com- 
plexity limit. Sources of real random numbers exist for com- 
puters called entropy gatherers — this paper reports on the 
effect of changing Tierra's pseudo random number generator 
for an entropy gatherer. 



Introduction 

The issue of how to create open-ended evolution in an arti- 
ficial system is one the open problems in artificial life. This 
paper examines two of the factors that have some bearing on 
this issue, using the Tierra artificial life system(Ray, 1991 1 1 . 
Tierra is well known artificial system, and well described 
in the literature, so only brief details will be given here. 
The digital organisms in Tierra consist of self-replicating 
codes written in a specially designed machine language. The 
Tierra environment is a virtual operating system executing 
the organism's code in a time shared manner. Evolution pro- 
ceeds through deliberately introduced mutations, copying 
errors and instruction flaws. Organisms compete for CPU 
time and memory space (called soup). 

'Available from http://www.his. atr.jprray/tierra/ 



Parsimony pressure is a tendency to penalise more com- 
plex organisms by the extra cost needed to reproduce longer 
genotypes, encouraging simplification to happen. In Ray's 
earliest experiments with Tierra, CPU time was allocated 
evenly between organisms, favouring organisms with the 
shortest genomes. The time sharing system was changed 
so that CPU time was allocated proportional to £ sllcePoM . 
When SlicePow=0, we have the original maximal parsi- 
mony pressure. When SlicePow=l, parsimony pressure 
is removed entirely. In this case, organism length rapidly 
increases, until the soup consists of one organism whose 
length is greater than half of Tierra's memory. At this point, 
it can no longer reproduce, and the soup dies (simulation 
stops). 

But do organisms get more complex? For this pur- 
pose, we define complexity to be the algorithmic informa- 
tion ( |Li and Vk anyi, 1997} of the organism. Adami(1998) 
introduced this measure in an artificial life setting, and I 
(Standish, 1999! developed a technique for measuring this 
in the Tierra setting. In (Standish, 20031, I report the first 
detailed study of a Tierra run. Whilst organisms get longer, 
their complexity shows no sign of increase at all. Their 
length comes from adding "junk" into their genomes. 

Obvious neither extremes of SlicePow leads to com- 
plexity growth, but what if we tuned the parsimony pres- 
sure to modest values? In previous experiments, I knew that 
SlicePow< 0.9 led to shorter genomes, not longer, so in 
this paper, I scan a range of SlicePow from 0.9 to 1 to see 
if there is an optimal value for generating complexity. 

Tierra (along with most ALife systems) use pseudo ran- 
dom number generators. Pseudo random number genera- 
tors are short algorithms satisfying certain statistical tests 
for uniformity and independence. However, being the prod- 
uct of an algorithm, the output of a pseudo random number 
generatoris notrandomby definition (Li and Vita nyi, 1997) . 
Algorithms can never create information, only destroy it. 
The complexity of any sequence of numbers is closely re- 
lated to the length of the shortest algorithm that produces 
it, so the total complexity of the Tierra system with pseudo 
random number generators is bounded by its initial com- 



plexity, implying that the individual organism complexity 
is bounded. Biological systems, however, have plenty of 
sources of randomness, ultimately dependent on quantum 
randomness, so do not have this complexity limit. 

The only way an algorithm can generate unbounded com- 
plexity is if it is coupled to a source of real randomness — a 
random oracle. Random oracles feature in Douglas Adams's 
description of the infinite improbability drive: 

a Bambleweeny 57 Sub-meson Brain coupled to an 
atomic vector plotter suspended in a strong Brow- 
nian Motion producer (say a nice hot cup of tea) 
( |Adams, 19791 Chapt. 10). 

It turns out to be simple enough to create random or- 
acles: Geiger counters attached to radioactive sources 
( |Gude, 198 5 I 2 and Lava lamps 3 are available through the in- 
ternet to provide sources of genuine randomness, however 
these sources are limited to about 30 bytes per second. 

Computers themselves have many different sources of 
random data available. They often interact with the exter- 
nal environment (eg users using keyboards, mice etc), and 
there is a small amount of randomness in timings in the hard 
disk (Jakobssonet al., 1998 1. Programs that harvest these 
sources of physical randomness are called entropy gatherers . 
Since unpredictability is important for cryptographic appli- 
cations, practical true random number generation has expe- 
rienced a lot of development in recent years. For example, 
the Linux operating system includes an entropy gatherer in 
its kernel, available as a character device at / dev/ random. 

Unfortunately, entropy gatherers, like the internet avail- 
able random oracles, tend to be slow producers of ran- 
domness. HAVEGE ( |Seznec a nd Sendrier, 2003 1 4 exploits 
many different sources of entropy with a modern computing 
system using hand crafted assembly language routines to in- 
crease the rate of entropy production by 3-4 orders of magni- 
tude over the techniques available in /dev/random. This 
random stream is then used to seed a lookup table accessed 
by a pseudo random number generator to produce random 
numbers at similar rates to traditional pseudo random num- 
ber generators. The entropy of the resulting sequence is less 
than a truly random sequence, but considerably higher than 
a pseudo random generator. 

In this paper, we replace the pseudo random number gen- 
erator in Tierra by calls to the HAVEGE generator, and com- 
pare what difference this makes to growth of complexity. 

Measurement of Complexity in Tierra 

The most general definition of complexity of an object in- 
volves two levels of description, a m/cro-description which 
is its implementation, and a macro-description which is the 



2 http://www.fourmilab.ch/hotbits/ 

3 http://www.lavarnd.org/ 

4 http://www.irisa.fr/caps/projects/hipsor/HAVEGE.html 



abstract meaning of an object. More than one microdescrip- 
tion can correspond to the same macrodescription. If (o(i,x) 
is the number of microdescriptions of length £ correspond- 
ing to macrodescription x, then the complexity of x is given 
by flStandish, 2001) : 



C(x) = lim^log/V-logco(£,x). 



(1) 



where N is the size of the alphabet used for the microdescrip- 
tion. Eq Q converges extremely rapidly for £ > C(x)/logN. 
The base of the logarithm determines what units you are 
measuring complexity in — if it is base 2, then the units 
are bits. For convenience, in this paper we will use base 32, 
corresponding to the alphabet size of the Tierra instruction 
set. Complexity is then measured in instructions. In order to 
measure the complexity of an organism in Tierra, we simply 
need to count up the number (i)(£,x) of computer programs 
of length £ that are equivalent to a given digital organism x. 

Not so simple! The first problem is how to determine if 
two computer programs are equivalent. The technique we 
use ( Standish, 2003 1, is to record the results of a tournament 
where an organism is pitted pairwise against all genotypes 
recorded from a given Tierra run. Since this includes the 
context that these organisms experienced, any difference be- 
tween two organism is expected to show up as a difference 
in the results of the two tournaments. 

The second problem is that the number of programs of 
length £ is 32 , a computationally infeasible number. In 
( Standish, 2003 1, I show that an alternative measure C ss is 
a good first order estimate of the organismal complexity: 



L-Si 



E lQ g 



32 k, 



(2) 



1=1 



where n,- is the number of mutations at site i on the genome 
that lead to differing pheno types. 

This quantity is now very tractable, with a complete anal- 
ysis of a 10 10 timestep Tierra run taking only a few hours 
on ac3's Linux cluster — comparable to the time taken to 
perform the original Tierra run. 5 

Results 

Tierra was run with SlicePow=0.9,0.9L. . 1.0 for 10 10 
timesteps (instructions executed), with a soupsize of 
131072, both with the original random number generator, 
and with HAVEGE. When SlicePow=l, the runs termi- 
nated early due to the soup dying. Figure[^shows the results 
for SlicePow=0.96 using the Havege generator, which 
produced the maximum complexity of any run. 

Figure |2] shows the maximum value of C ss recorded for 
each run, as a function of parsimony pressure. It is showing 
fairly clearly that a SlicePow value between 0.95-0.96 is 



5 The analysis code is available from 

http://parallel.hpc.unsw.edu.aU/getaegisdist.cgi/getdeltas/eco-tierra.3 
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Figure 1: Plot of length and C s s for the 36 unique phenotypes created during the run with SlicePow=0.96, and the HAVEGE 
entropy source. Three organisms appear with complexity greater than the ancestral organism 0080aaa within the first fifty 
million timesteps, including one with a complexity nearly twice that of the ancestor, and one appears at 4.1 x 10 9 after which 
only simple organisms originate. 
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Figure 2: Plot of the maximum C ss recorded as a function of parsimony pressure. PRNG = pseudo random number generator, 
and HAVEGE is the entropy harvester mentioned in the text. 



needed to generate additional complexity. It is also sugges- 
tive that the entropy gatherer generates additional complex- 
ity over the pseudo random number generator, however this 
effect is unlikely to be statistically significant in this data set. 
A different view of the data can be seen in figure[3] where 
the origination of the organism of maximal complexity is 
plotted. The interesting thing here is that the pseudo ran- 
dom number generator took a lot longer to find its maximally 
complex organism than the entropy gatherer algorithm. 

Discussion 

The results reported here are of a small scale pilot study to 
study the effect of parsimony and of random oracles. Obvi- 
ously much work needs to be done to tighten up the method- 
ology of the experiment, and to perform analysis of statisti- 
cal significance on the results. Clearly, this experiment does 
not show evidence of open-ended complexity growth either. 
Nevertheless, these interim results look encouraging. 

Since the Tierra simulations are run in parallel on a Linux 
cluster, and it doesn't matter if one uses the same sequence 
of random numbers in each simulation, it should be possible 
to combine the entropy harvested from all CPUs, as well as 
network latency on the interconnect to substantially increase 
the entropy production of HAVEGE. This will require sub- 
stantial recoding of HAVEGE to exploit this. 

In the history of Earth's biosphere, complexity of indi- 
viduals remain largely static for the first 2 billion years of 
life. It was only with originations of Eukaryotes circa 2Gya 
and and of multicellular life circa 600Mya that we have 
any appreciable jump in complexity over the previous 2 bil- 
lion years of bacterial life. During Phanerozoic (540Mya- 
present), there is little unambiguous evidence of complexity 
growth of organisms! McShea, 1996 1. However, what is a 
very clear trend is growth in the complexity of ecosystems. 
Diversity of the Earth's biosphere appears to have grown ex- 
ponentially since Cambrian times (Ben ton, 200 1) . 

Looking at things another way, a multicellular animal can 
be considered as an ecosystem of eukaryotic cells (OK so 
the genetic code for most of the cells is identical — gut flora 
being the obvious exception), and each eukaryotic cell can 
be considered an ecosystem of bacterial cells (nucleus + or- 
ganelles). If anything, the "parts" of the World's biosphere 
have gotten simpler — it's the network connecting the parts 
that shown the complexity growth. In the Tierra case, what 
we should be looking for is overall complexity of the ecosys- 
tem, not complexity of the individual digital organisms. 

At present, we still don't have a good theory for how to 
measure the complexity of an ecosystem, knowing its food- 
web. Diversity is a lower bound on complexity, but is a 
rather crude indicator of overall complexity. Bedau-Packard 
(Bed au et al., 19 98 1 statistics use diversity as one of the key 
indicators of open-ended evolution. This is probably all that 
is ever likely to be available for the Earth's biosphere, but 
in artificial life systems we can look for better measures of 



ecosystem complexity. 
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Figure 3: Plot of the origination time of the organism with maximal complexity, as a function of parsimony pressure. PRNG : 
pseudo random number generator, and HAVEGE is the entropy harvester mentioned in the text. 
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